Hybrid Text Regression Model for Predicting Review Helpfulness BI Congress 3 : Driving Innovation through Big Data
نویسندگان
چکیده
Business intelligence and analytics are playing an increasingly prominent role in many organizations. User-generated content and online social media open up new opportunities for businesses that can exploit and innovate with this new source of Web 2.0 data. In this paper, we concentrate on one important application – predicting the helpfulness of online customer reviews. We frame it as a regression problem and apply text mining techniques. We propose a hybrid feature selection approach, which combines a filter with a wrapper, for a BI text regression problem. Based on online review data collected from Amazon.com, we demonstrate empirically that the hybrid approach produces the best prediction results among all the models examined. This study is the first to develop and validate a hybrid feature selection methodology for text regression in a BI context.
منابع مشابه
CSE 255 Assignment 1: Helpfulness in Amazon Reviews
In this paper we consider models for predicting the helpfulness rating of Amazon book reviews. We examine features such as the review’s star rating, the length of the review text, the readability of the review text, and the amount of comparisons made in the review. We compare Support Vector Machine and Random Forests models both for regression and classification.
متن کاملLearn To Rate Fine Food
We investigate a food review dataset from Amazon with more than 500000 instances, and utilize information from the data set such as the text review, score, helpfulness, etc. Instead of the traditional word representation using frequency, we use skip-gram to train our own word vectors using the pretrained GloVe Twitter word vector as the initialized value. We also use recursive parsing tree to t...
متن کاملFrom abstract to impact in cardiovascular research: factors predicting publication and citation
AIMS Through a 4-year follow-up of the abstracts submitted to the European Society of Cardiology Congress in 2006, we aimed at identifying factors predicting high-quality research, appraising the quality of the peer review and editorial processes, and thereby revealing potential ways to improve future research, peer review, and editorial work. METHODS AND RESULTS All abstracts submitted in 2006...
متن کاملIRHM: Inclusive Review Helpfulness Model for Review Helpfulness Prediction in E-commerce Platform
Online reviews have become essential aspect in E-commerce platforms due to its role for assisting customers’ buying choices. Furthermore, the most helpful reviews that have some attributes are support customers buying decision; therefore, there is needs for investigating what are the attributes that increase the Review Helpfulness (RH). This research paper proposed novel model called inclusive ...
متن کاملPredicting The Helpfulness Of Online Product Reviewers: A Data Mining Approach
The purpose of this study is to propose a data mining approach to predict the helpfulness scores of online product reviewers. Such prediction can facilitate consumers to judge whether to believe or disbelieve reviews written by different reviewers and can help e-stores or third-party product review websites to target and retain quality reviewers. In this study, we identify eight independent var...
متن کامل